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ABSTRACT. Terrestrial arthropod surveys and inventories frequently suffer from undersampling bias; 
common species are over-represented and rare species may be missed entirely. This study compared a 
rapid (3 days) and intense inventory of spiders from one hectare of a mature beech forest (Feigns sylvaticus) 
in Hestehaven, Denmark, comprising 8,710 adult spiders of 66 species to a previous, much more thorough, 
bi-weekly survey of two years duration from the same site that comprised 42,273 spiders (adult and 
juvenile) of 141 species. Mon-parametric species richness estimators were used to assess the degree of 
undersampling bias in various data partitions. The current study used five experienced, four novice col¬ 
lectors, and five semi-quantitative collecting methods. Method and time of day strongly affected numbers 
of species and adults per sample. Collector experience affected numbers of species but not numbers of 
adults per sample. Despite the intensive collecting, number of adults per sample did not decrease over the 
course of the study. At the end of the sampling, 31 species were still rare in the sample (singletons or 
doubletons). Non-parametric richness estimators suggest that the actual richness of adult spiders in the 
study plot at this time of year and susceptible to the methods used was about 80 species. Species turnover 
between the two surveys (ca 23 years) was remarkably small: the two lists were 92% identical. The base¬ 
line study suggests that the rarity of 12 of the 31 rare species was artifactual (10 due to phenology, one 
to method, another to spatial edge effects). The rarity of the remainder is unexplained and by default is 
interpreted as undersampling bias. 
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Conservation and natural resource manage¬ 
ment decisions notoriously draw mainly on 
ecological information obtained from verte¬ 
brates and plants (Kremen et al. 1993; Pren- 
dergast et al. 1993). Environmental monitor¬ 
ing is much the same, even though it is widely 
recognized that patterns in vertebrates and 
plants do not exemplify the patterns in many 
other groups in the same habitat (Groom- 
bridge 1992; Stork & Sam ways 1995). The 
bias towards vertebrates and woody plants 
stems from a simple reason: surveys are usu¬ 
ally so short and resource-limited that only the 
best-known and least diverse groups can be 
adequately sampled. 

One consequence is that arthropods, the 
most diverse organisms in any terrestrial en¬ 


vironment (May 1978), often go unstudied 
(Longino 1994). The most basic data provided 
by traditional biodiversity assessments is the 
richness and relative abundance of species in 
a given area (May 1975; Taylor 1978) and on 
both counts arthropods present formidable 
challenges (Erwin 1983; McKarney 1999; No¬ 
votny & Basset 2000). Arthropods, however, 
may provide information not otherwise ob¬ 
tainable from traditional focal groups; infor¬ 
mation that may turn out to be crucial for long 
term management of existing natural resourc¬ 
es (Kremen et al. 1993). They are small in size 
(therefore abundant), short-lived (going 
through many generations within short time 
spans), diverse and often have limited distri¬ 
butions and strict environmental requirements 
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(e.g., many mites and small spiders live ex¬ 
clusively within a few square meters of soil 
on the forest door). In theory, they should 
map environmental diversity and track envi¬ 
ronmental changes more quickly and precisely 
than longer-lived, more flexible organisms 
such as vertebrates and plants. 

Collecting prodigious numbers of species 
and individuals is easy, but the proportion of 
the total available fauna represented in such 
collections is usually unknown. The omni¬ 
present high frequency of rare species instead 
suggests that arthropod communities in gen¬ 
eral are drastically under-sampled by conven¬ 
tional survey efforts, even large ones. To 
judge the real utility of arthropods for envi¬ 
ronmental assessment and monitoring, there¬ 
fore, we first need to be able to assess the 
thoroughness and efficiency of inventories and 
censuses themselves. These, in turn, require 
relatively fast, cheap, efficient and robust 
sampling protocols. Such methods have been 
proposed for spiders in tropical ecosystems by 
Coddington and co-workers (Coddington et al. 
1991) and tested in Cameroon, Tanzania 
(Sprensen et al. 2002), Madagascar, Bolivia 
(Coddington et ah 1991, 1996), Guyana, To¬ 
bago, southern USA (Coddington et al. 1996; 
Dobyns 1997), Slovenia (Kuntner & Baxter 
1997), Denmark (this study) and Greenland 
(Larsen & Rasmussen 1999). 

The optimal test of empirical and analytical 
inventory methods would be against a 
“known universe” in which the fauna is a) 
natural, b) diverse, and c) thoroughly known. 
For spiders such sites are few indeed. One 
possibility is the International Biological Proj¬ 
ect (IBP) site (Thamdrup et al. 1975) in Hes- 
tehaven, Denmark. The information available 
on spiders from this mature beech forest is 
unique because collecting was carried out bi¬ 
weekly for several years in the 1970’s with a 
large battery of ecological sampling methods. 
All material collected was identified to species 
and instar, including juveniles (Toft 1976). 
The accuracy of juvenile identifications is fre¬ 
quently, and perhaps justifiably, questioned. 
Toft (1976) argued that in his specific case the 
error rate was acceptably low because the so¬ 
matic morphology of many species was dis¬ 
tinct, the total diversity was low and the clar¬ 
ity of distinct phenologies resulting in 
unambiguous adults made retroactive identi¬ 
fication of juveniles feasible (see also Toft 


1983). Because even crude measures of the 
effect of ignoring juveniles are almost non¬ 
existent in the literature (but see Norris 1999) 
and because the question is intrinsically inter¬ 
esting, for the purposes of comparison to our 
data we accept the accuracy of Toft’s deter¬ 
minations. Data from identified juveniles re¬ 
veal community phenology patterns, and also 
can quantify the bias resulting from the prac¬ 
tical necessity of modern inventories to focus 
on adult animals only, whether in tropical in¬ 
ventories or less well understood temperate 
areas. The proportion of juveniles in a tropical 
sample seems remarkably constant at 60-70% 
(Coddington, Scharff, pers. obs; Russell- 
Smith & Stork 1995; Silva 1996). Tropical as¬ 
sessments to date have worked with adults 
only, because tropical spider faunas are so lit¬ 
tle known that identification of juveniles of 
other than ostentatious species is impossible. 
Even adults are often impossible to identify to 
anything but morphospecies in the tropics. 
The main objective of this study was to eval¬ 
uate in several ways this inventory design 
against a “known spider universe.” Of course, 
local faunas do change with time (in this case 
a 23 year hiatus) and stochastically due to dis¬ 
persal and local extinctions. Nevertheless, we 
know of no spider communities from climax 
communities as well known as that of Heste- 
haven. We wanted to test how well the method 
worked in a temperate forest community 
where a few species would be numerically 
dominant and wished to investigate the impact 
of extremely rapid surveys using many si¬ 
multaneous collectors (with a mix of profes¬ 
sional arachnologists and collectors with little 
or no collecting experience) on the fauna, and 
to learn how unavoidable factors such as 
method, day vs. night collecting and collector 
experience affected results. Finally we wished 
to calibrate richness statistics from an ex¬ 
tremely rapid and intense inventory against 
the known richness for the same season and 
against the total known annual spider fauna. 

METHODS 

Study site. —The study was carried out dur¬ 
ing 3 days, August 29-September 1, 1994, in 
the mixed coastal forest, Hestehaven (176 
hectares), about 25 km NNE of Arhus, Eastern 
Jutland, Denmark. The forest is approximately 
15 meters above sea level and surrounded by 
agricultural land. A one hectare sampling plot 
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(56°17.46'N, 10°28.50'E) was established 
within a 3-hectare climax stand of mature 
beech ( Fagus silvaticus L.). A map of the for¬ 
est including the location of the sampling plot 
and the distribution of vegetation is given in 
Rasmussen et al. (1982; fig. 2). The nearest 
stand of non-beech vegetation is located ap¬ 
proximately 75 meters from the sampling plot 
and consists of spruce. The distance from the 
sampling plot to the nearest agricultural area 
is 250 meters. The plot perimeter was marked 
with strings to ensure that all collecting only 
took place within the plot. Danish ecologists 
have intensively studied the arthropod fauna 
of this particular stand of beech in the period 
1969 to 1972 in connection with an interna¬ 
tional study of beech-wood ecosystems (Niel¬ 
sen 1974a, 1974b, 1974c, 1975, 1977, 1978a, 
1978b, 1987). The composition of the spider 
fauna was analyzed by Toft (1976, 1978) and 
other scientific results of this beech-wood pro¬ 
ject in Hestehaven have been published in 66 
scientific papers. There is no other place in 
Denmark, and few elsewhere in the world, 
where a well-known arthropod fauna has been 
studied in such detail. 

The oldest beech trees within the plot are 
more than 110 years old and very little regen¬ 
eration occurs. The density of beech trees is 
approximately 190 trees per hectare. Mature 
beech forests severely reduce light reaching 
the forest floor, and their root systems effec¬ 
tively compete against other woody plants. 
The Hestehaven forest floor vegetation is 
dominated by Anemone nemorosa L., Melica 
uniflora Retz., Asperula odorata L., Hordeum 
europaeum (L.), Circaea lutetiana L., Car ex 
sylvatica Huds., Veronica montana L., and Fi- 
caria verna Huds., (Nielsen 1977). At the end 
of August, the forest floor was dominated by 
knee-high grass and scattered areas with ferns. 
As is typical for mature northern European 
beech forests, the understory supported few 
bushes and small trees and therefore very little 
vegetation that could be reached by hand. 

Collectors. —Nine collectors worked si¬ 
multaneously in the field. Five of these were 
classified a priori as “experienced” (many 
years collecting spiders), and the other four as 
inexperienced (no or less than one year ex¬ 
perience in spider collecting). Sampling began 
on August 29 at night (2000-2400), continued 
day (0900-1800) and night on August 30-31, 
and concluded during the morning of Septem¬ 


ber 1. Even by “rapid” inventory standards 
three days is extremely short, but various 
scheduling conflicts prohibited a longer du¬ 
ration. Collecting both night and day is im¬ 
portant to make sure that both diurnal and 
nocturnal species are collected. Each collector 
was asked to use each collecting method a 
certain number of times during the fieldwork. 
Collectors were limited to 6 or fewer samples 
per day or night to avoid fatigue. One person 
kept track of all the samples taken by various 
collectors, methods and times of day, thereby 
ensuring that the different methods were used 
both day and night and making sure that work 
was carried out efficiently. 

Collecting methods. —We used five col¬ 
lecting methods to access the spider fauna 
within the plot. These were chosen to access 
as many different habitats, and to overlap as 
little as possible. Because the time span of the 
inventory was so short we did not use pitfalls 
and for logistic reasons Berlese or Tullgren 
funnel extraction of litter were not feasible. 
Each sample represented one method applied 
for 1 hour of active, continuous collecting 
(i.e., including time required to transfer the 
catch to a vial, but excluding time due to in¬ 
terruptions). Collectors used countdown time 
functions in wristwatches to time themselves. 
The countdown was suspended if the collector 
moved to a new habitat patch or if occupied 
with non-collecting tasks (i.e., logistics, 
equipment maintenance, field notes, photog¬ 
raphy, etc.). A sample therefore usually took 
somewhat more than one hour to finish. 

Aerial: Searching through the vegetation 
from knee height to as high as the collector 
can reach above his/her head. Toti et al. 
(2000) changed the name to “aerial” to em¬ 
phasize the target guild, but it is synonymous 
with “looking up” method of Coddington et 
al. (1991). 

Ground: Searching the ground and lower 
vegetation below knee height. Toti et al. 
(2000) changed the name to “ground” to em¬ 
phasize the target guild, but it is synonymous 
with “looking down” method of Coddington 
et al. (1991). The collector searches on hands 
and knees for spiders on the surface of plants, 
tree stems, logs, rocks, and the ground surface 
but not the interior of leaf litter, logs, under 
stones etc. 

Sweeping: Searching the lower herb layer 
with a sweep net (net diameter 36 cm). The 
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net was emptied after a few sweeps to avoid 
damage to the specimens. In this study the 
diversity of the vegetation available for 
sweeping was rather limited and dominated by 
grass and small, scattered areas with ferns. 

Beating: Sharply tapping branches or other 
vegetation with a stout stick while holding a 
0.6 rn 2 beating tray underneath to catch the 
falling spiders. Beating tray areas varied 
among collectors, but because samples were 
defined on the basis of time rather than repe¬ 
titions or area, beating tray areas are probably 
unimportant. Small spiders are efficiently 
transferred from the beating tray to the sample 
vial with an aspirator or pooter. Because ma¬ 
ture beech trees have very few lower branch¬ 
es, this plot had little vegetation suitable for 
beating, and consequently we allocated fewer 
resources to beating and more resources to 
other methods. Beating at night was difficult 
because of headlamp glare and yielded sparse 
results, so we eliminated that combination. 

Cryptic: Searching for adult spiders under 
logs, inside rotten logs, sifting litter, manual 
search within leaf litter, under rocks, inside 
holes, under bark, etc. It is intended to access 
any habitat the “cryptic” fauna is likely to 
occupy and allows the collector to use the 
method best suited to the opportunities the 
particular habitat offers. 

Specimens and sorting procedures.— 
Each sample was labeled with locality, date, 
collector, method, and replicate number (if 
two samples were otherwise identical). Sam¬ 
ples were more or less immediately trans¬ 
ferred to 70% ethanol in a WhirlPak® bag so 
that field vials could be reused. A mixture of 
experienced and inexperienced (students) ar- 
achnologists working in groups sorted the col¬ 
lection to species so that the experienced ar- 
achnologists could validate identifications 
(identifiers are listed in the Acknowledg¬ 
ments). All identifications of singletons and 
doubletons were checked and verified by sev¬ 
eral arachnologists. Voucher specimens of 
each species identified in this study are de¬ 
posited at the Zoological Museum, University 
of Copenhagen (ZMUC). Duplicates have 
been deposited at the Smithsonian Institution, 
Washington D.C. (USNM) and at the Califor¬ 
nia Academy of Sciences (CAS), San Fran¬ 
cisco, CA. 

Statistical analysis. —Statistical analyses 
and graphs were produced with Systat 9.0 


(SPSS Inc. 1999). To analyze the effects of 
inventory design parameters on results, we 
chose analysis of variance in which method, 
time of day, and collector experience were 
treated as independent factors, and numbers of 
adults and species per sample, respectively, as 
dependent variables. Post-hoc Tukey HSD 
tests were used to determine which treatments 
were responsible for significantly different 
factors. Due to the large number of factors and 
treatments, some ANOVA cells were empty. 
For example, we did not beat at night, and 
therefore beating was excluded from analyses 
involving method and time of day. A third 
analysis investigated the influence of individ¬ 
ual collectors on the overall mean number of 
species per sample. A fourth analysis contrast¬ 
ed the number of species per sample by meth¬ 
od and time of day for sets of experienced and 
inexperienced collectors. Species accumula¬ 
tion curves and richness estimates were pro¬ 
duced with Estimates 6.0b 1 (Colwell 2000). 
The current dataset is hereafter referred to as 
“ZMUC” (Zoological Museum University of 
Copenhagen) and the historical dataset from 
Toft (1976) is referred to as “AAU” (Aarhus 
University). 

Lognormal distributions were computed 
manually as no available programs retain the 
benefits of the classical approach and also 
solve the problem of the biased 0-1 octave 
(Lobo & Favila 1999; Longino et al. 2002). 
Many programs define abundance classes as 
log base 3, which prevents integer values from 
falling on class boundaries, but it also col¬ 
lapses the full distribution to relatively few 
abundance classes for most datasets, and the 
chi square test therefore lacks power. Log base 
2, as Preston (1948) originally suggested, 
maintains a relatively fine-grained classifica¬ 
tion of the data and is easy to compute. The 
problem of singleton species is more subtle. 
Most techniques (e.g. Preston 1948, 1962; 
Ludwig & Reynolds 1988; see also Bliss 
1966) apparently divide the singleton species 
between the 0.5—1 and 1-2 octave, just as oth¬ 
er values falling on class boundaries are di¬ 
vided. However, all higher octaves potentially 
receive from both neighboring boundaries, but 
the 0.5-1 octave cannot draw from the 0.25— 
0.5 octave, as species with fractional relative 
abundances are not observed. The practical ef¬ 
fect of this bias is that the 1-2 octave is al¬ 
ways larger than the 0.5-1 octave because it 
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contains half the l’s and 2’s. This produces a 
false mode in the data and distorts the calcu¬ 
lation of the lognormal parameters. Because 
the 0.5-1 octave is always biased, it should 
be ignored during the calculation of parame¬ 
ters. We iteratively assigned “octave” num¬ 
bers (r) to the log base 2 abundance classes 
and estimated the lognormal parameters SO 
(mode) and a (width) using the Nonlin module 
of Systat 5.2, with Quasi-Newton estimation 
and least squares fit (model: S = S 0 e ( a2 ' r2) . 
The optimal set of assignments minimized the 
chi square difference between estimated and 
observed richness (s) across octaves. 

“Sampling intensity” is the ratio of speci¬ 
mens to species (Coddington et al. 1996; 
Sprensen et al. 2002). The chief virtue of this 
measure is its simplicity: it can be calculated 
for any inventory whatever. Given roughly 
comparable relative abundance distributions 
and richness, it crudely compares sampling ef¬ 
fort to the size of the universe being sampled 
(but see Gotelli & Colwell (2001) for pitfalls). 
Inventory completion (or completeness) is the 
extent to which an inventory, or inventory 
component, samples the faunal partition avail¬ 
able to it (Sprensen et al. 2002). Equal sam¬ 
pling effort in microhabitats or diversity par¬ 
titions that vary in richness can result in 
disproportionately rich microhabitats being 
disproportionately undersampled. The usual 
symptom of such biased sampling is a strong 
correlation between sampling effort and rich¬ 
ness (Heyer et al. 1999), which in turn can 
bias conclusions about relative species rich¬ 
ness. For spiders, different methods and day 
versus night collecting access different parti¬ 
tions of the overall community with varying 
efficiency, and those partitions also differ in 
richness and abundance (Silva 1996; Silva & 
Coddington 1996; Coddington et al. 1996; 
Sdrensen et al. 2002). We measure “inventory 
completion” in an inventory partition as the 
ratio of observed richness to the Chaol rich¬ 
ness estimate for that partition (Sprensen et al. 
2002). Comparison of species richness esti¬ 
mators generally favor Chao2 as among the 
least biased, most efficient, and most robust 
methods (Colwell & Coddington 1994; Peter¬ 
son & Slade 1998; Walter & Martin 2001). 
Chao2, however, requires replicate sampling. 
Chaol performed nearly as well as Chao2 in 
tests, is simply calculated from tabular data, 
and is the only non-parametric richness esti¬ 


mator that does not require replicate sampling. 
It can therefore be applied to more kinds and 
qualities of inventory data, and will enable 
broader comparison of completion statistics 
across inventories. Our allocation of sampling 
effort reflected the idiosyncrasies of the site 
and our a priori assessment of the relative 
richness of different microhabitats. The dense 
beech canopy had suppressed nearly all un¬ 
derstory shrubs and the beech trees them¬ 
selves lacked all lower branches. The herb 
layer was knee-high uniform grass with inter¬ 
spersed fern clones. Therefore, we allocated 
relatively less effort to beating and more to 
cryptic and ground searching compared to ae¬ 
rial searching and sweeping. Ideally, an in¬ 
ventory should be an unbiased sample of the 
community. In practical terms this means that 
each method or time of day partition should 
reach the same degree of inventory comple¬ 
tion; equivalently, the coefficient of variation 
of inventory completion should be equal to or 
less than that of sampling effort investment 
across the variation inventory partitions. 

RESULTS 

The nine collectors produced 149 samples 
over 3 days containing a total of 8,710 adults 
of 66 species from the one hectare plot (Ap¬ 
pendix; Table 1). The mean number and stan¬ 
dard deviation of total samples per collector 
was 16.56 ± 0.72 (n = 149), aerial was 2.33 
± 0.71 (n = 21), beating was 0.89 ± 0.71 (n 
= 8), cryptic was 5.56 ± 1.59 (n = 50), 
ground was 5.22 ± 1.72 (n = 47) and sweep¬ 
ing was 2.56 ± 1.01 (/? = 23). Overall sample 
intensity (specimens : species) was 132, but it 
ranged from 24-110 per method (because 
methods often catch the same species, the to¬ 
tal sample intensity is usually greater than that 
of any partition). The figure of 132 may seem 
high, but is biased by the extraordinary abun¬ 
dance of two species (Linyphia triangularis 
(Clerck 1757) and Drapetisca socialis (Sun- 
devall 1833)). If these are excluded, the 
ZMUC sampling intensity falls to the medio¬ 
cre value of 12, which is well below 30, our 
current working guess of the minimum sam¬ 
pling intensity statistic typically sufficient to 
yield convincingly asymptotic richness esti¬ 
mates. Nineteen species were singletons and 
12 were doubletons. Despite the large number 
of animals collected, the final percentage of 
singletons was high at 29%. Linyphia trian- 


Table 1.—Summary values for the ZMUC inventory at Hestehaven. SD = Standard deviation, Spp = Species. Percent method bias is the deviation of the 
method’s inventory completion statistic from the grand mean among methods. Percent effort investment is the percent of total samples invested in a particular 
method. 
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gularis and D. socialis at 2,135 and 2,046 in¬ 
dividuals, respectively, dwarfed the abundanc¬ 
es of other species and accounted for 48% of 
the total inventory. The true relative abun¬ 
dance of at least D. socialis was even greater 
because we truncated collection of this species 
at 10 specimens per sample after the first 
night. We continued to collect L. triangularis 
because it could not be reliably distinguished 
in the field from the much rarer Linyphia hor- 
tensis Sundevall 1830 or Neriene clathrata 
(Sundevall 1830). Hourly samples averaged 
58 individuals and 9 species overall. Cryptic 
sampling yielded the fewest individuals per 
hour (19) and aerial the most (120), but meth¬ 
ods were remarkably uniform in average num¬ 
bers of species per sample (7-10). Richness 
per sample ranged from 2-14 species, and 
abundance from 2-273 individuals. 

Collector experience, method, and time 
of day. —Abundance but not number of spe¬ 
cies per sample required log-transformation 
prior to analysis to maintain normality. Col¬ 
lector experience, method, and time of day 
were treated as independent factors in the AN- 
OVA model, and numbers of adults and spe¬ 
cies per sample, respectively, as dependent 
variables. As mentioned under “Methods,” 
beating at night is difficult due to glare and at 
this site was unproductive. We therefore ex¬ 
cluded beating at night as a method-time of 
day combination and excluded it from these 
analyses. Collector experience significantly 
increased number of species per sample, but 
not number of adults (F = 7.029, P < 0.000, 
Fig. 1). Method affected both number of spe¬ 
cies ( F = 7.029, P < 0.000) and numbers of 
adults per sample (F = 20.429, P < 0.000, 
Fig. 2). Aerial and sweep sampling produced 
more adults per sample than cryptic (P < 
0.000 vs aerial; P < 0.000 vs. sweeping) or 
ground sampling (P < 0.026 vs aerial; P < 
0.003 vs. sweeping), and cryptic and ground 
also differed significantly from each other (P 
< 0.000). For numbers of species per sample, 
sweeping, ground and cryptic collecting did 
not differ from each other (Fig. 2), but aerial 
produced fewer species per sample than 
ground (P < 0.000) or sweeping (F < 0.006). 
Night collecting significantly increased num¬ 
bers of adults per sample (P < 0.009) but not 
species (Fig. 3). The model explained 75% of 
the variance in numbers of adults and 34% of 
the variance in numbers of species per sample. 


No factor interactions were significant in ei¬ 
ther ANOVA. To investigate more fully the 
effect of individual collectors on numbers of 
species per sample, we ran an ANOVA with 
collector identity and method as independent 
factors and numbers of species per sample as 
the dependent variable (Fig. 4). A post-hoc 
Tukey HSD test showed that collectors 5 and 
6 differed from 1 and 3, and collectors 7 and 
8 also differed from 3. Collector 5, classified 
a priori as experienced was more similar to 
inexperienced collectors (collector 5, Fig. 4). 
During the day experienced collectors were 
much more efficient at aerial searching and 
beating, less so at cryptic and ground search¬ 
ing, and indistinguishable from inexperienced 
collectors at sweeping (Fig. 5). At night in¬ 
experienced collectors were only slightly less 
efficient at aerial sampling and were equiva¬ 
lent sweepers, but the gap widened during 
cryptic and ground collecting (Fig. 6). Sweep¬ 
ing was the only method used here that 
seemed completely unaffected by experience. 
In summary, method strongly affected both 
abundance and richness, experience produced 
moderately larger numbers of species but not 
individuals and spiders were generally more 
accessible (active) at night than during the 
day. 

Complementarity of methods. —Thirty 
species were unique to single methods and the 
overlap between methods was moderate, rang¬ 
ing from 11 species shared between “ground” 
and “beating” to 22 species shared between 
cryptic and ground. Each method sampled 
unique species not found by the other methods 
(Table 1). 

Faunal depletion. —We tested for the ef¬ 
fect of intensive collecting on the overall spi¬ 
der fauna by plotting individuals per sample 
against chronologically arranged sample num¬ 
ber (Fig. 7). If all species are included, abun¬ 
dance does decrease over the sampling period 
(“All species,” Fig. 7). However, this de¬ 
crease is primarily due to our decision after 
the first night to truncate collection of the very 
abundant D. socialis in each sample after 10 
animals had been collected. If the two most 
common species are excluded ( D . socialis , L. 
triangularis ), spider abundance per sample 
did not decrease significantly over the course 
of the study (“most common excluded,” Fig. 
7). We further checked this result by lagging 
the data and testing for cross-correlation to the 


SCHARFF ET AL.—BEECH FOREST SPIDER RICHNESS 


253 


12 

10 

8 

6 

4 
2 

Experienced Inexperienced Aerial Cryptic Ground Sweep 

13 

11 

9 

7 

5 




Collector 




Figures 1—4.—Least squares means and standard errors from analysis of variance on number of species 
(open circles) and log of numbers of adults (closed circles) per: 1. Sample by collector experience. Ex¬ 
perience increases richness but not abundance per sample; 2. Sample by method. For abundance, cryptic 
and ground sampling differ from each other and both from aerial and sweeping. For richness, aerial differs 
from ground and sweeping; 3. Sample by time of day. Night collecting increases number of adults but 
not species; 4. Least squares means and standard errors from analysis of variance on number of species 
per sample by experienced (open circles) and inexperienced (closed circles) collectors. 


original time line; the plot showed no signif¬ 
icant trend. Collector fatigue and boredom 
with common species probably also played a 
role. 

Inventory completion. —The mean inven¬ 
tory completion by method was 71%, and sep¬ 


arate methods deviated —16% to +17% 
around this value (percent method bias, Table 
1). Figure 8 compares observed to estimated 
richness for each method, day versus night, 
and the total inventory. Aerial sampling was 
most complete at 88% and sweeping least 
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Figures 5-6.—Least squares means and standard errors from analysis of variance on number of species 
per sample by method and collector experience during: 5. Day; 6. Night (Abbreviations: AE = Aerial, 
BE = Beating, CR = Cryptic, GR = Ground, SW = Sweeping). 


complete at 55% despite essentially equal 
sampling effort. Day and night sampling, on 
the other hand, were equally complete at 72%, 
despite nearly twice as much investment in 


daytime sampling (Table 1). Overall, the co¬ 
efficient of variation for sampling effort across 
methods and times of day was 68%, but that 
for inventory completion only 18%, showing 
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Figure 7.—Number of adults per chronologically arranged sample for all the data and with the two 
most common species removed, with least squares linear fits to each sequence. 
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Method 


Figure 8.—Total species caught by each method and time of day, with Chaol estimates and inventory 
completion values for each partition. 


that differential investment does compensate 
for differential richness in habitats. Neverthe¬ 
less, sweeping and ground faunas appear to 
have been relatively undersampled and aerial 
and cryptic faunas relatively oversampled 
compared to the mean inventory completion, 
so that this particular allocation profile miti¬ 
gated, but did not eliminate bias due to dif¬ 
ferential return on effort by method. Richness 
plotted against individuals collected still 
shows positive slope and correlation (Fig. 9, 
“original”). If “return on effort” were satu¬ 
rated, the regression line would be essentially 
flat. Although the current inventory still 
shows a non-zero slope and correlation, less 
effort would have yielded an even steeper 
slope. Figure 9 also plots regressions for one 
third and two thirds of all samples randomly 
chosen from each method. One third as much 
effort shows a much steeper slope, and two 
thirds is intermediate between one third and 
the total data set, as expected (Fig. 9). Al¬ 
though substantial effort was invested in this 
inventory, it was nevertheless insufficient to 


eliminate correlation between sampling effort 
and observed richness. 

Richness estimation. —The rank-abun¬ 
dance plot for the 66 observed species shows 
a characteristically temperate faunal distribu¬ 
tion with relatively many common and few 
rare species compared to tropical faunas (Fig. 
10). The ZMUC data fit a lognormal distri¬ 
bution (chi square goodness of fit, P < 0.7) 
but show no mode (Fig. 11, “ZMUC”). The 
richness estimation curves show typical signs 
of an incomplete inventory (Fig. 12): the ob¬ 
served curve terminates substantially below 
the estimator curves and is not asymptotic, the 
estimators are not consistently asymptotic, the 
uniques curve is still rising or barely flat, lies 
relatively far above the doubletons curve, and 
shows no sign of crossing it, and the double¬ 
tons curve is definitely still rising. At face val¬ 
ue, the richness estimators presented here im¬ 
ply about 80-90 species present as adults in 
the area sampled and accessible to the meth¬ 
ods used, of which we observed only 66 
(73%). 
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Figure 9.—The correlation between richness and sampling effort by method (measured as number of 
individuals sampled) for 33%, 66% and the total (original) dataset, with least squares fits to each partition. 



Rank 


Figure 10.—Rank abundance diagram for the to¬ 
tal dataset. 


Comparison to AAU study. —S0ren Toft 
sampled the same hectare as well as the sur¬ 
rounding beech forest on a roughly biweekly 
schedule from July 1969-July 1971 using five 
methods: litter extraction, pitfall and stem 
traps, sweeping and “clubs”: a method in 
which trees are struck with very large clubs 
to dislodge the canopy fauna (Nielsen 1975; 
Toft 1976). He obtained 43,580 spiders of 147 
species (plus 3 species that he could only as¬ 
sign to genus) over the two year study and 
classified them all to species and, if juvenile, 
to instar. The original AAU data sheets still 
exist and we used them to compile a database 
of species by instar, abundance and sample 
characteristics (i.e. date, method, etc.). Not 
surprisingly after 23 years, some discrepan¬ 
cies could not be resolved, but the database 
eventually accounted for 42,273 animals of 
141 species, comprising 15,533 adults and 
26,740 juveniles (Table 2). The missing spe- 
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Figure 11.—Lognormal fits for the ZMUC and AAU data alone and combined. 


cies and animals were mainly extremely 
small, unidentifiable juveniles that we exclud¬ 
ed from the data. This unparalleled arachno- 
logical data set offers a unique opportunity to 
evaluate critically the more rapid and certainly 
less thorough ZMUC inventory at the same 
site. When pooled, the AAU 1969 and 1970 


August and September collections total 2,260 
adults. August alone comprised 47 species (11 
singletons, 7 doubletons) and September 49 
species (10 singletons, 12 doubletons); togeth¬ 
er the list comprised 57 species (16 singletons, 
13 doubletons). Considering that for these two 
months Toft collected only about one fourth 



Number of individuals 


Figure 12.—Curves for observed richness, rare species, and richness estimators for the ZMUC inventory 
against sampling effort. 
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Number of Individuals 

Figure 13.—Curves for observed richness, rare species, and richness estimators for the ZMUC and 
AAU data for the four week period centered on the ZMUC inventory. 


as many adults as the ZMUC total of 8,710, 
his results are certainly comparable to our 66 
species (19 singletons and 12 doubletons). 
The AAU August and September adult data 
contained 13 species not found in the ZMUC 
data, all but two singletons or doubletons, 
whereas the ZMUC study found 22 species 
not found by AAU for August-September, all 
but three singletons or doubletons. If the 
ZMUC list is compared to the total, annual 
AAU list, juveniles included, only five ZMUC 
species are missing from the AAU dataset. 

The full two years of AAU data (including 
juveniles) considered separately and together 
with the ZMUC data yield a more complete 
lognormal distribution: the AAU data alone 
do show a mode one octave to the right of the 
ZMUC maximum (chi square goodness of fit, 
P < 0.975), and the two datasets combined 
(chi square goodness of fit, P < 0.9) place the 
mode even further to the right (Fig. 11). 

The “best” estimate of the instantaneous 
richness during the ZMUC inventory is pre¬ 
sumably that based on the maximum data 
available for the seasonal period and the meth¬ 
ods used. The ZMUC inventory used neither 
pitfall traps nor “clubs” but sweeping and lit¬ 
ter sifting were common to both studies, and 
stem traps are quite similar to aerial searching. 


Excluding the latter methods and taking into 
consideration annual seasonal variation, we 
selected all adults collected two weeks before 
and after the ZMUC sampling dates as the 
most complete data set for this time period 
(thus adding 1969-1970 AAU data to the 
ZMUC study) and calculated richness esti¬ 
mates using this dataset totaling 9,871 adults. 
Figure 13 shows these curves. The estimates 
appear substantially better than the ZMUC 
data alone: the estimator and observed curves 
are more asymptotic and closer together; the 
singleton and doubleton curves actually cross. 
The parametric richness of the adult spider 
fauna is suggested to be around 80 species. 

The observed and Chaol estimated species 
richness calculated for each month of the 
AAU dataset show the summer peak expected 
in a north temperate fauna (Fig. 14). August- 
September is substantially past the annual 
May-June richness peak, judged either by ob¬ 
served or estimated richness. Comparison of 
estimated to observed richness for the AAU 
study shows that the level of sampling effort 
was relatively better early in the year, insuf¬ 
ficient to keep up with the May-June peak, 
recovered somewhat in August-September, 
and fell off again in October-December. 
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Months 

Figure 14.—Total species observed (adults only and including juveniles) for each month of the AAU 
inventory, with Chaol estimates based on both partitions. 


Monthly-observed adult richness varied from 
23 in January to 78 species in June. 

Despite the large number of animals col¬ 
lected, the ZMUC inventory still contained 19 
singletons and 12 doubletons. The null hy¬ 
pothesis of richness estimation is that rare spe¬ 
cies indicate undersampling. However, there 
are at least three alternative explanations for 
“rare” species besides undersampling bias. 
Conceptually these are all “edge effects” due 
to time, method, or space (Longino et al. 
2002 ). 

Phenological edge effects. —A phonologi¬ 
cal edge effect is an individual that is mature 
outside the normal breeding season of its spe¬ 
cies. The AAU study aimed principally to re¬ 
construct the life history and phenologies of 
the spider community at this site. These data 
(Fig. 15) can be used to “diagnose” which of 
the rare ZMUC species are “phenological 
edge effects.” For example, 48 of the ZMUC 
species are normally adult at the time of the 
inventory, but 13 are typically adult at other 
times: 12 earlier and one later. We counted a 


species as a phenological edge effect (as op¬ 
posed to just being rare) if its total abundance 
in the AAU study was more than 10, and the 
time span of adults did not include August or 
September. Of the 19 singleton and 12 dou¬ 
bleton species in the ZMUC inventory, eight 
singletons ( Anyphaena accentuatci (Wale ken- 
aer 1802), Araniella curcubitina (Clerck 
1757), Hypomma cornutum (Blackwall 1833), 
Linyphia hortensis Sundevall 1830, Micrcir- 
gus herbigradus (Blackwall 1864), Neriene 
peltata (Wider 1834), Walckenaeria obtusa 
(Blackwall 1836), Pachygnatha listen Sun¬ 
devall 1830) and two doubletons (Diploce- 
phalus latifrons (O. P.-Cambridge 1863), 
Saaristoa abnormis (Blackwall 1841)) were 
out of season and arguably are not evidence 
of undersampling bias. 

Methodological edge effects. —A method 
edge effect is an individual of a species that 
typically inhabits a microhabitat not accessed 
by any of the methods used, or, at least, not 
efficiently accessed. If a singleton or double¬ 
ton ZMUC species was commonly collected 
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in the AAU study by a method not used in 
the ZMUC study (i.e. pitfalls or clubs), it is 
arguably a methodological edge effect and not 
evidence of undersampling. As above, if the 
total AAU abundance was more than ten and 
mainly caught via pitfalls or clubbing, we 
count it as a methodological edge effect. Ner- 
iene peltata , Achaearanea lunata , and espe¬ 
cially A. accentuata were all substantially 
more common in the canopy than in subcan¬ 
opy strata. Some also showed minor peaks in 
abundance in sweep samples, suggesting that 
rarely animals may fall or jump from the can¬ 
opy and so appear in the herb layer. Walcken - 
aeria ohtusa , M. herhigradus , P. listen , D . 
tihiale , Lepthyphantes cristatus , L. pallidus , S. 
abnormis , and D. latifrons were taken almost 
exclusively by pitfall traps, although the latter 
also appeared in litter samples. These 11 spe¬ 
cies are probably rare in the ZMUC study be¬ 
cause they are accessible mainly via methods 
we omitted, although five were also out of 
season. 

Spatial (habitat) edge effects. —A spatial 
edge effect is an individual of a species that 
prefers a habitat not present in the study area. 
The hectare was fairly uniform, but it had a 
wet depression at its lower end. Tetragnatha 
obtusa C.L. Koch 1837 may have been rare 
in both studies because it prefers wetter situ¬ 
ations and thus barely enters the plot. Argu¬ 
ably it is not evidence of undersampling. Al¬ 
though not present in the ZMUC study, 
Hyptiotes paradoxus (C.L. Koch 1834) was 
rare in the AAU study; it prefers the conifer¬ 
ous plantations adjacent to the study hectare 
and may have been sporadically and unrelia¬ 
bly present within study margins. Metellina 
merianae (Scopoli 1763), Erigone atra Black- 
wall 1833 and Nuctenea umbratica (Clerck 
1757) were rare in both studies, suggesting 
they may be typical of habitats other than ma¬ 
ture beech forest. No Hestehaven “rare” spe¬ 
cies are truly rare in Denmark. 

In sum, of the 19 singleton and 12 double¬ 
ton ZMUC species, nine-singletons and three 
doubletons are rare due to edge effects and 
should not be considered as evidence of un- 
dersarnpling. If these species are excluded 
from the inventory, and richness estimates re¬ 
calculated (Fig. 16), the quality of the inven¬ 
tory improves substantially. The estimator 
curves are definitely asymptotic (at about 
4,000 sample size), the observed curve still 


trails the estimator curve, and the uniques 
curve is almost flat, and the duplicates curve, 
unusually, goes to zero. 

DISCUSSION 

How many species of spiders typically in¬ 
habit one hectare of northern European climax 
beech forest? How much effort is required to 
answer the question or estimate that number, 
or how would one know when an observation 
or estimate was accurate? These questions 
make sense only if assumptions about tem¬ 
poral and spatial scales are made explicit. The 
minimum realistic spatial scale that is biolog¬ 
ically real is one large enough to include 
demes of all resident species, species-area ef¬ 
fects aside. Biparental organisms, in other 
words, should be present at least in abun¬ 
dances of two, and for all practical purposes 
many more. The latter reasoning provides a 
strong common sense justification for the 
Chao estimators of species richness, as they 
trade on the ratio of singletons (biological 
non-sequiturs) to doubles to correct for un¬ 
dersampling bias. Species literally present in 
a hectare as singletons don’t make biological 
sense because they can’t reproduce and must 
represent long-distance dispersal; doubletons, 
for all practical purposes, are the same. Of 
course, many animals live at spatial scales 
larger than a hectare, but for spiders, one hect¬ 
are (100 X 100 m) seems like a reasonable 
minimum spatial scale because it is unlikely 
that the breeding population structure of spi¬ 
ders, i.e. the “nearest-neighbor distance,” is 
so dispersed that single hectares are likely to 
contain one or fewer individuals. A spatial 
scale of one meter might be appropriate for 
litter fauna but inappropriate for large curso¬ 
rial hunters or web weavers. For the latter, as 
a guess, even 10 m seems excessive. At larger 
spatial scales the species-area effect will be 
increasingly important. The Danish national 
checklist currently stands at 500 species 
(Scharff 1984). Checklists overestimate cur¬ 
rent standing diversity because they are cu¬ 
mulative, and not corrected for faunal turn¬ 
over. Thus, the “instantaneous,” ecologically 
meaningful, richness of spider species in Den¬ 
mark is probably less. 

The “checklist” of Hestehaven listed by 
Toft (1976) includes 147 identified species. 
Twenty-three years later the ZMUC study 
added only five more (M. merianae (Scopoli 
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1763) , Larinioides patagiatus (Clerck 1757), 
Ozyptila praiicola (C.L.Koch 1837), Walcken- 
aeria atrotibialis (O. P.-Cambridge 1878), 
Xysticus ulmi (Hahn 1831)), all singletons or 
doubletons except L patagiatus (7 individu¬ 
als). The absence of faunal additions in 23 
years is impressive (we cannot comment on 
losses): 92% of the ZMUC species were 
shared with the AAU study. None of the spe¬ 
cies added by the ZMUC study were present 
at a relative abundance of more than 0.0008, 
which also suggests that the fauna is stable 
over time. 

“Instantaneous" Hestehaven richness is 
much less than 147 species, at least during 
August—September. The monthly adult rich¬ 
ness observed by Toft (1976) ranged from a 
January low of 23 to a June high of 78 (Fig. 
14) and averages 47 ± 18 (sd); the August- 
September values were 47 and 49. These fig¬ 
ures do not include species present as juve¬ 
niles, which comprised 63% of the spider 
community in Toft’s study. If juveniles are in¬ 
cluded, monthly richness varies from a Janu¬ 
ary low 7 of 29 to a May high of 99, and av¬ 
erages 64 ± 22 (sd); the August-September 
values were 65 and 69. Although the average 
AAU monthly sampling intensity for adults 
was only 24, the average monthly percent sin¬ 
gletons was 27%, essentially the same as in 
the much more intense 2.5 day ZMUC effort 
(29%). Although this small sampling effort 
seems to provide the same percent singletons 
as the much more intense ZMUC study, our 
experience is that even small decreases in per¬ 
cent singletons demand logarithmic increases 
in effort. All the AAU figures still suffer from 
undersampling bias. The best bias-corrected 
figures we have are the adult-only estimates 
for August-September provided by the com¬ 
bined AAU-ZMUC data, which is about 75- 
80 species (Fig. 13). This figure, then, is pre¬ 
dicted to be the ballpark adult spider richness 
a complete survey would find for this season 
in this forest using the methods of the ZMUC 
inventory. 

Chaol estimates of monthly richness from 
the AAU study, including juveniles, range 


from a January low of 37 to a June high of 
120; August and September values are 85 and 
79, respectively (Fig. 14). Because these fig¬ 
ures include juveniles, phenological edge ef¬ 
fects are minimized, in which case remaining 
possible biases are method and spatial edge 
effects. Hyptiotes paradoxus is perhaps the 
only undeniable example of the latter, a spe¬ 
cies which “should not have been" in the 
Hestehaven beech wood. The spectrum of 
methods used by Toft accessed all substrates 
used by spiders except the high canopy. Both 
the intense sampling of the ZMUC survey and 
the effort to identify juveniles by the AAU 
survey yield essentially the same estimates: 
the per-hectare August-September standing 
spider species richness at Hestehaven is prob¬ 
ably about 80 species. 

If the above is true, the Hestehaven check¬ 
list richness of 150 species at first seems par¬ 
adoxical. If the greatest monthly observed 
richness is only about 120 species, juveniles 
included (Fig. 14), where are the remaining 
30 species? The most obvious explanation is, 
again, undersampling bias in the AAU study. 
Even though Toft identified every animal col¬ 
lected to species, the substantial fraction of 
singletons in all AAU partitions argues that 
he missed quite a few species. However, two 
other explanations should be considered. First, 
the missing species may not be permanent 
year-round residents in the sampled hectare. 
This implies considerable flux of species such 
that the standing richness indeed fluctuates be¬ 
tween about 40 and 120 species, which in turn 
poses the question of where these species go. 
As the many hectares of forest adjacent to the 
study area were essentially identical, mass mi¬ 
gration seems unlikely. Second, if the 30 spe¬ 
cies do remain in the hectare, why is the ob¬ 
served richness not more consistent from 
month to month? Only two possible explana¬ 
tions seem likely. First, they may vertically 
migrate into canopy strata that neither inven¬ 
tory accessed. Nielsen (1974c, 1987) found 
that portions of the arthropod fauna do mi¬ 
grate up and down at this site several times a 
year. Second, for some portion of the year 


Figure 15.—Phenology for the 66 species observed in the ZMUC inventory, based on AAU data. Thin 
horizontal lines give the range during which adults were found (gaps not indicated). Grey indicate peak 
adult abundances, if present. Squares mark the ZMUC inventory. 
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Figure 16.—Curves for observed richness, rare species, and richness estimators for the ZMUC data 
pruned of artifactually rare species (see text). 


these species certainly exist only as eggs or 
may hide in retreats not accessible to the col¬ 
lecting methods. Thus, the ecologically mean¬ 
ingful late summer spider fauna is probably 
close to the Chao estimates in Fig. 13—about 
80 species. 

The AAU and ZMUC studies are best com¬ 
pared on the basis of adults only. For the same 
time period the AAU sampling intensity was 
much lower than the ZMUC study (27 vs. 
132), but percent singletons was roughly com¬ 
parable (22 vs. 29%). In terms of specimens 
collected, the ZMUC study was five times 
more intense than the AAU effort. The extra 
ZMUC effort netted about 20 more species, of 
which 18 were singletons or doubletons—ex¬ 
actly the sort of species the less intense AAU 
sampling effort would be expected to miss. 

Both Figs. 13 & 14 suggest that the ZMUC 
inventory sampled more spiders than neces¬ 
sary to estimate richness. Perhaps 3—4,000 
adults sampled would have been sufficient to 
estimate species richness, given that the rank- 
abundance distribution was heavily skewed 
towards a few extremely abundant species. 
This depends on collectors being able to rec¬ 


ognize accurately the very abundant species 
in the field. 

The heavy reliance on hand-searching dur¬ 
ing the ZMUC study did not yield significant 
numbers of species that were missed by the 
AAU study’s reliance on methods less depen¬ 
dent on collector experience. Collector expe¬ 
rience does significantly improve the number 
of species taken per sample, and, usually, ex¬ 
perienced collectors do not differ among 
themselves (see also Sprensen et al. 2002). 
The collector in Fig. 4 that was classified a 
priori as experienced did not actually have any 
experience with this sort of sampling, al¬ 
though he had reportedly collected spiders for 
many years. How much time a naive collector 
requires to become “experienced” is still an 
open question. Collector experience has only 
a minor and insignificant effect on numbers of 
animals. In particular, the maximum number 
of animals caught per hour (273) greatly ex¬ 
ceeds the average (58), which means that ob¬ 
served sample abundances are not limited 
simply by how fast collectors can collect. 
Granted that that human collectors as a sam¬ 
pling method will have its own intrinsic bias 
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(as do ail sampling methods), variation in ob¬ 
served abundances probably does reflect gross 
differences in true relative abundance In the 
field. Experienced collectors, at any rate, do 
not catch more species because they catch 
more animals; the reason is probably that they 
know more places to look in order to find spi¬ 
ders. 

The extreme ecological dominance of L. tri¬ 
angularis and D. socialis made this inventory 
less complete than it otherwise would have 
been. Nearly 50% of the animals collected— 
quite a practical measure of sampling (and 
sorting) effort—disappeared into the arguably 
useless exercise of collecting superfluous 
specimens. In fact these two species illustrate 
the extremes of the effects that extreme eco¬ 
logical dominance can have. After one night 
we truncated collection of D. socialis because 
it was very abundant and easily recognizable. 
Human collectors can do this. If we had used 
automated ecological traps, a very great many 
more D, socialis would have died. At the oth¬ 
er extreme, two rare species look enough like 
L. triangularis in the field that one cannot re¬ 
liably distinguish them. Therefore we contin¬ 
ued to collect “L. triangularis and eventu¬ 
ally collected one L. hortensis late in the 
survey. We sacrificed accuracy of the relative 
abundance of common species to focus on 
rare species and to moderate our effect on the 
fauna. Still, the superabundance of a few spe¬ 
cies may make it hard for collectors to collect 
the remainder in an unbiased way. The most 
abundant spider species in tropical ecosystems 
rarely exceed 15% of the total (Coddington et 
aL 1991, 1996; Silva & Coddington 1996; Sil¬ 
va 1996), and that seems mainly to occur at 
high elevations (Sprensen et al. 2002). Very 
common species actually may make temperate 
ecosystems more difficult to survey in some 
ways than tropical systems. 

As expected, collection method and time of 
day also influence results (Figs. 5-6). Not 
only are some methods more productive, all 
methods seem to access different sampling 
universes (Table 1), which justifies the broad¬ 
est possible spectrum of collecting methods in 
faunal inventories that aim to be complete. 

Sampling methods access different compo¬ 
nents of the fauna. Equal effort among meth¬ 
ods implicitly assumes that all methods are 
equally efficient, and that the sampling uni¬ 
verses particular to each method are roughly 


the same size. These assumptions are clearly 
unrealistic, and thus to minimize sampling 
bias, inventories should differentially allocate 
effort among methods, if a goal of the inven¬ 
tory is to sample the community with as little 
bias as possible. We suggest that inventory 
completion is a reasonable, albeit imperfect, 
statistic to measure this bias. It implies that 
the optimal allocation strategy would yield 
similar inventory completion measures for all 
inventory partitions, whether by method, time 
of day, or other partitions. Thus, all partitions 
might be undersampled, but they would be, In 
some sense, ‘'equally 5 ’ under-sampled. The 
ZMUC study emphasized cryptic and ground 
searching in anticipation of large numbers of 
ground-dwelling linyphiid species. The results 
suggest that the cryptic fauna was relatively 
over-sampled, and the ground fauna relatively 
undersampled, which in turn suggests that the 
sample of the overall spider community we 
obtained is biased in particular ways, although 
not as much as It would have been had the 
sampling allocation been more nearly uni¬ 
form. One could, for example, calculate rich¬ 
ness for various taxa and assess how 7 well the 
methods sampled those taxa. If the sampling 
regime had lasted more days, litter and pitfall 
samples could have been added without di¬ 
minishing the amount of time for collector- 
based sampling. We certainly support using as 
many techniques as resources permit. Allo¬ 
cation of sampling effort across methods is a 
serious problem. Although ideally the sample 
should reflect the parametric community, and 
in theory richness estimators should identify 
departures from that ideal, we do not know if 
the inventory completion statistics are robust 
from one study or region to another. If a given 
investment in, say, sweeping, produces wildly 
different and unpredictable results from place 
to place, year-to-year in the same place, or 
study-to-study, it will not be a useful analyt¬ 
ical technique. 

Grossly different numbers of samples be¬ 
tween methods or times of day inevitably pro¬ 
duce highly unbalanced statistical designs for 
analysis of variance. However, the natural his¬ 
tory logic of investing more in productive as 
opposed to less productive methods in our 
view outweighs the analytical convenience of 
a completely balanced design. First, the sta¬ 
tistical differences we detected in this study 
are large (P < 0.00) and are unlikely to dis- 
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appear in a balanced design. Second, modern 
statistical packages can correct much better 
for unbalanced designs than formerly. Third, 
if necessary one can include only the first N 
samples in each analysis of variance cell, 
where N is the global minimum cell size. This 
provides an unambiguous way to test sam¬ 
pling effects while still freeing the investiga¬ 
tor to allocate sampling effort in the way best 
calculated to access efficiently and accurately 
the total fauna. 

This study demonstrates design and analyt¬ 
ical methods by which undersampling bias in 
terrestrial arthropod surveys can be detected 
and measured. The evidence for severe un¬ 
dersampling bias in arthropod surveys is per¬ 
vasive if measured by percent singletons. 
Large samples do not indicate a thorough in¬ 
ventory if the inventory scope was broad. In 
spiders, for example, the fogging of the can¬ 
opy of a single tree from Manu National Park, 
Peru by T. L. Erwin yielded 222 adult spiders 
of 124 species, 63% of which were singletons, 
and multiple tree canopies from Tambopata, 
Peru, yielded 1,821 adult spiders of 645 spe¬ 
cies, 55% of which were singletons (Codding- 
ton, unpublished data). Silva (1996) reported 
43% singletons in a collection of 5,895 adults 
of 1,140 species from Samiria, Peru, collected 
mostly by fogging. A recent spider canopy 
study from Tanzania had 23% singletons 
(Sprenson 2003). Other authors often report 
diversity statistics for fogging samples rather 
than raw numbers, but because Fisher’s alpha 
approximates the number of singletons, Rus¬ 
sell Smith and Stork (1994) must have found 
an average of about 45% singleton spider spe¬ 
cies in fogging samples at four stations along 
an elevational transect in Sulawesi. Subcano¬ 
py manual collecting in Manu yielded 2,616 
adults of 498 species with 42% singletons 
(Silva and Coddington 1996). Three points 
along an elevational transect in Bolivia aver¬ 
aged 44% singletons in subcanopy faunas 
(Coddington et al. 1991, 1996). Kuntner and 
Baxter (1997) found 54% singletons in sub¬ 
canopy collections in Slovenia. Singleton per¬ 
centages for spider inventories are not out of 
line with terrestrial arthropods generally. No¬ 
votny and Basset (2000) collected over 80,000 
homopterans, but these comprised over 1,000 
species, of which 27% were singletons. Toft 
(1976) and this study together collected 
50,983 spiders of 146 species, but again 27% 


were singletons. Basset et ak (1996) collected 
4,696 individuals of 391 species of beetles, 
and percent singletons was 39% (Basset 
1997). Basset and Kitching (1991) collected 
20,500 individuals of 759 subcanopy and can¬ 
opy species but 36% were singletons. The 
canopy fraction was higher at 45%; among 
spiders it was 42%. Allison et al. (1997) sam¬ 
pled 3,977 individuals of 481 species of bee¬ 
tles, but 46% were singletons. Erwin (1997) 
reports collecting 15,869 Peruvian beetles of 
3,429 species of which 50% were singletons. 
Janzen and Schoener (1968) reported 65% 
singletons In their arthropod collections from 
all of Costa Rica, and Noyes (1989) reported 
60% in Chalcidoidea from Sulawesi. Monteith 
and Davies (1984) likewise found 40% sin¬ 
gletons during a month-long survey of 
Queensland rainforest. In relative abundance 
distributions such as these doubletons are very 
probably about half the singletons, so some¬ 
thing like 50-70% of the species found in 
many 64 state of the art” arthropod surveys are 
known from two or fewer individuals. 

This study was able to explain only about 
a third of the rare species as artifacts of one 
sort or another. The remainders imply that 
even after intense sampling, observed richness 
understated true richness by at least 20%. The 
relative abundances of the species found by 
ZMUC and missed by AAU for August-Sep- 
tember is consistent with the hypothesis that 
most “rare” species (singletons) in terrestrial 
arthropod surveys are legitimate members of 
the community. The tendency to ignore rare 
species as “tourists” should be viewed with 
skepticism (Stork & Samways 1995). These 
estimates formally are all lower bounds (Bun¬ 
ge & Fitzpatrick 1993; Colwell & Coddington 
1994), so the actual situation is probably 
worse. Figures 12, 13 and 16 show that in 
practice estimators’ asymptote only when 
about two-thirds or more of the species are 
already observed. In sum, both statistical spe¬ 
cies richness estimators and the observed rich¬ 
ness are negatively biased with respect to 
parametric community richness for most of 
the time course of an inventory. Richness es¬ 
timates statistically corrected for undersam¬ 
pling bias are nevertheless more accurate than 
the raw, observed richness, and, depending on 
the degree of accuracy required, probably al¬ 
most never show significant positive bias in 
practice. 
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Because return-on-effort in inventories is 
inevitably curvilinear, direct comparisons of 
richness values between sites are likely to be 
fraught with bias and error (Gotelli & Colwell 
2001). This study also shows that for the first 
2,000 or so specimens as a measure of effort, 
even the most aggressive richness estimators 
are still strongly and negatively biased. Con¬ 
sidering that a sample of 2,000 animals from 
a parametric community richness of about 80 
species still represents a sampling intensity of 
25, and that sampling intensities of less than 
10 are probably typical of most work, one 
must question the prevailing paradigm of 
spreading arthropod inventory resources as 
thinly as possible in pursuit of broad goals and 
diverse taxa. It is the rare terrestrial arthropod 
inventory taxon that does not have twice, or 
even 10 times the anticipated diversity as any 
sympatric vertebrate group, and arthropod sur¬ 
veys generally make do with less resource 
than vertebrate surveys. Masters theses that 
envisage a single student sampling a diverse 
taxon once or twice a month over an annual 
cycle in a seasonal environment is almost cer¬ 
tain to result in data so sparse that absence 
due to undersampling bias will be indistin¬ 
guishable from that due to biologically inter¬ 
esting variation (McArdle & Gaston 1993). Of 
course, not all surveys aim to measure or es¬ 
timate richness, but comparative species rich¬ 
ness is increasingly the most important datum, 
at least initially, in biodiversity conservation 
(Mittermeier et al. 1998). Nevertheless, until 
very recently, manuals and treatments of in¬ 
ventory methods rarely mention undersam¬ 
pling bias (Hayek & Buzas 1997; Stork & 
Sam ways 1995), but see (Leitner & Turner 
2001). Compared to the initial costs of mount¬ 
ing the survey to begin with, designing and 
funding it well enough to secure verifiably re¬ 
liable data seems at most a marginal cost in¬ 
crease. If the conservation of biodiversity de¬ 
pends on reliable data, both funding agencies 
and the designers of inventory protocols 
should reconsider prevailing practices. 
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